Page content display with conditional scroll gesture snapping

ABSTRACT

A platform allows experts, for example home improvement professionals, to upload their portfolios, i.e. content such as photos, videos, text, and sound, to a publicly available resource, such as a Web browser accessible, network based commerce system. Users may then browse the content, for example by room, style, and metro area. The content is arranged in collections that are in part passively sorted, based upon user relevance. Tags are shown on objects, for example within photos, for which there is more information. In the case of a movable display device, the tags can simulate the physics of real tags, for example they can move back and forth when the device is shaken. In another embodiment, a snap point is set to impart either a scrolling transition or a step transition between display pages, based upon user scroll activity.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of, and claims priority to, U.S. patent application Ser. No. 13/472,409, filed May 15, 2012, now U.S. Pat. No. 9,230,223, which claims priority to provisional patent application Ser. No. 61/567,012, filed Dec. 5, 2011, each of which is incorporated herein in its entirety by this reference thereto.

BACKGROUND OF THE INVENTION

Technical Field

The invention relates to the interactive organization and presentation of content within a semi-supervised expert domain. More particularly, the invention relates to the consistent presentation of content and passive relevance determination of content relationship in an on-line commerce system.

Description of the Background Art

Electronic commerce, commonly known as e-commerce, eCommerce, or e-comm, refers to the buying and selling of products or services over electronic systems such as the Internet and other computer networks. However, the term may refer to more than just buying and selling products online. It also includes the entire online process of developing, marketing, selling, delivering, servicing, and paying for products and services.

The amount of trade conducted electronically has grown extraordinarily with widespread Internet usage. The use of commerce is conducted in this way, spurring and drawing on innovations in electronic funds transfer, supply chain management, Internet marketing, online transaction processing, electronic data interchange (EDI), inventory management systems, and automated data collection systems. Modern electronic commerce typically uses the World Wide Web at least at one point in the transaction's life-cycle, although it may encompass a wider range of technologies such as e-mail, mobile devices, and telephones as well.

A large percentage of electronic commerce is conducted entirely in electronic form for virtual items, such as access to premium content on a website, but mostly electronic commerce involves the transportation of physical items in some way. Online retailers are sometimes known as e-tailers and online retail is sometimes known as e-tail. Almost all big retailers are now electronically present on the World Wide Web.

Electronic commerce that takes place between businesses is referred to as business-to-business or B2B. B2B can be open to all interested parties, e.g. commodity exchange, or limited to specific, pre-qualified participants (private electronic market). Electronic commerce that takes place between businesses and consumers, on the other hand, is referred to as business-to-consumer or B2C. This is the type of electronic commerce conducted by companies such as Amazon.com. Online shopping is a form of electronic commerce where the buyer is directly online to the seller's computer, usually via the Internet. There is no intermediary service involved. The sale or purchase transaction is completed electronically and interactively in real-time, such as in Amazon.com for new books. However in some cases, an intermediary may be present in a sale or purchase transaction such as the transactions on eBay.com.

Online shopping has evolved to the point that a specific user experience is often provided for particular types of goods. Online shopping also includes recommendations, user reviews, and other social aspects of the goods listed on a particular site. Unfortunately, there is no way for a user to make much sense of the listed items because no thought goes into arranging and presenting the items in a meaningful way relative to each other across categories, nor is expert advice associated with the online sale of goods, nor is user interaction with the goods during their presentation considered in subsequent presentation of the goods. Further, most online commerce is presented in a format that belongs to an earlier era of the Internet, and that is not particularly well adapted for modern tablet and handheld devices.

SUMMARY OF THE INVENTION

The presently preferred embodiment of the invention comprises a platform that allows experts, in this embodiment home improvement professionals, to upload their portfolios, i.e. content such as photos, videos, text, and sound, to a publicly available resource, such as a Web browser accessible, network based commerce system. This content is uploaded by using self-service tools. Users may then browse the content, such as photos, by room, style, and metro area. Uniquely, the content, in this example the photos, is arranged in collections that are in part passively sorted, based upon user relevance.

Further, an embodiment of the invention presents tags on objects in photos for which there is, for example, more information, price information, and/or product information. In the case of a movable display device, the tags on the photo can simulate the physics of real tags, for example they can move back and forth when the device is shaken.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1a is a screen shot showing a photo ad according to the invention;

FIG. 1b is a further screen shot showing products listed in the photo ad of FIG. 1 a;

FIG. 2 is an example of product ads according to the invention;

FIG. 3 is a screen shot showing idea book recommendations according to the invention;

FIG. 4 is a screen shot showing the recommending of new items that a user has not seen before according to the invention;

FIG. 5a is a screen shot showing the suggesting of items related to an item that is being shown according to the invention;

FIG. 5b is a screen shot showing a Web site linked to from the photo in the screen shot of FIG. 5 a;

FIG. 6 is a screen shot of a photo category page, as shown on a handheld device, according to the invention;

FIG. 7 is a screen shot of a photo stream page, as shown on a handheld device, according to the invention;

FIG. 8 is a screen shot of a photo stream page for an individual photo, as shown on a handheld device, according to the invention;

FIG. 9 is a screen shot of a profile page, as shown on a handheld device, according to the invention;

FIGS. 10a and 10b are screen shots of a photo stream page for an individual photo that has animated tags, as shown on a handheld device, according to the invention;

FIGS. 10c and 10d are screen shots of the photo ad page linked to the animated tags of FIGS. 10a and 10a , as shown on a handheld device, according to the invention;

FIGS. 11a-11f are screen shots that show pages as displayed with snapping (FIGS. 11a and 11 b) and smart layout (FIG. 11 c) according to the invention;

FIG. 11d is a screen shot that shows scrolling without snapping between the pages shown in FIGS. 11a and 11 b;

FIG. 11e is a screen shot that shows scrolling without snapping between the page shown in FIGS. 11b and 11 c;

FIG. 11f is a screen shot that shows display of the page of FIG. 11c without the use of the smart layout feature of the invention; and

FIG. 12 is a block schematic diagram of a machine in the exemplary form of a computer system within which a set of instructions for causing the machine to perform any one of the foregoing methodologies may be executed.

DETAILED DESCRIPTION OF THE INVENTION

The presently preferred embodiment of the invention comprises a platform that allows experts, in this embodiment home improvement professionals, to upload their portfolios, i.e. content such as photos, videos, text, and sound, to a publicly available resource, such as a Web browser accessible, network based commerce system. This content is uploaded by using self-service tools. Users may then browse the content, such as photos, by room, style, and metro area. Uniquely, the content, in this example the photos, is arranged in collections that are in part passively sorted, based upon user relevance. This is accomplished in an embodiment by providing users with tools for creating idea books. The idea books are produced when a user selects various photos in various categories for inclusion in their idea books.

The system monitors the user selection process to determine relationships between items within photos in selected categories. The system collates this information across the many users of the system and, thus, passively determines user relevance. For example, in the embodiment concerning residential interior design, photos may be organized in categories that correspond to the different rooms of a house, such as the bedroom, living room, etc. A user may proceed at the room level, adding design elements, furniture, and the like to an idea book for a particular room, for example the living room. Here, there are many styles available to the user, such as classic, modern, European, Asian, and the like. The user's idea book selections are that user's opinion as to design elements that may coordinate with each other, even if they are selected from a variety of styles. The system makes note of these connections for each user and, in the process, identifies user-based groupings that may then lead to new categories and/or suggestions to other users.

Further, this process may be applied across rooms, as well as styles. In this way, user idea book selections across a user population are used to identify trends and tastes. This information can be used to create recommendations, to reorganize content within categories, to create new categories, and to alert design experts and vendors with regard to user preferences.

In addition to passive determination of user relevance, an embodiment of the invention also allows user interaction with the content. For example, users may tag objects in photos and this information can be processed, as above for passive interactions, to organize and reorganize the presentation of content.

Photos may also be tagged editorially with links to information about the tagged objects. For example, a tag may provide a link to an expert discussion of the tagged object or to an expert in connection with the expert's design services, a link to other objects that are related aesthetically to the object or that are thematically consistent with the object, and/or a link to a distributor or manufacturer Website to facilitate commerce in connection with the tagged object and/or related objects. Further, the user may link to additional views of the tagged object, obtain options, select different coverings for the object, such as a different type of upholstery, which is then displayed in the photo, and save these objects in the user's idea book.

Thus, another unique aspect of the invention is the use of photo-based advertising. In contrast to banner ads, sidebars ads, pop-ads, and the other that populate the Web, the invention herein provides collections of photos that are related categorically and that are determined to be relevant by any or all of passive user interaction, user tagging, and expert supervised tagging. In this embodiment of the invention, all communication is image based. Rather than have ads that pop out and grab a user's attention, often annoyingly, an embodiment of the invention presents all photos, for example, in a collection in a consistent format.

Tags

The photos in a category or grouping are of the same size and shape or similar in size and shape, the objects in the photos are all thematically and/or aesthetically related to all other photos in the category or grouping, the use of icons or tags to indicate links to additional information is consistent. Thus, the user is presented with a ‘walled garden’ in which the substance of each photo is the focus of the presentation, and not the use of annoying or distracting attention getting devices. In this way, each ad itself looks like each other ad in a photo stream and the user may thus direct his attention to the perceived merits of the objects within the photos, and not the format of presentation of the photos.

Due to the relevance determination, filtering to a user query is precise. If a user wants to view photos related to, for example, dens, and the user is interested in French contemporary styling, then the photo stream presented to the user is constrained to relevant content, the content is consistently presented to allow the user to concentrate on the design elements within the photos and not on the presentation of the photos, and thus make a more natural and meaningful decision when selecting objects for inclusion in the user's idea book, and the user is afforded the opportunity to link to additional information upon selecting tags within a particular photo.

In an embodiment, users are able to ask professionals questions about the photos. Both users and an editorial team can tag the photos with products. Users can save photos to collections. Users can also browse a visual directory of professional service providers.

The tags presented to the user may be of various types. For example, the shape of the tag and/or color may be used to denote different classes of information. Thus, a dollar sign shaped tag may denote a link for purposes of commerce, a question mark may denote a link for expert advice, and the like. Further, it is clear that hand held devices, such as the iPad, are becoming increasingly popular and such devices are ideally suited for use in connection with the invention herein disclosed. For example, an embodiment of the invention provides motion sensitive tags that are activated when the display device, e.g. the iPad, is moved in a particular fashion, such as by shaking. Such action can make the tags appear, make the tags disappear, animate the tags, for example where the tags move back and forth to draw attention to them. In this and other embodiments, the user may set preferences with regard to the tags, for example, if animation is desired, if particular classes of tag are to be excluded or included, and if tag selection executes a particular action, such as adding a reference to the user's idea book, or generating a purchase inquiry, and/or if tags should not be displayed at all, i.e. they may be turned off.

Thus, an embodiment of the invention presents tags on objects in photos for which there is, for example, more information, price information, and/or product information. In the case of a movable display device, the tags on the photo can simulate the physics of real tags, for example they can move back and forth when the device is shaken (see FIGS. 11a and 11b , discussed below). In such case, the tags swing around a rotation point using pendulum equations of motion, or approximate pendulum equations of motions which are more computationally efficient on small devices, such as smartphones. In an embodiment, an approximate pendulum equation of motion is: thera″=−k theta−f theta′  (1)

where:

-   -   theta is the angle between the tag image orientation on the         gravity vector, either the true gravity vector derived from the         device sensors, or the apparent gravity direction derived from         the image orientation;     -   theta′ is the angular velocity;     -   theta″ is the angular acceleration;     -   k represents a pendulum spring coefficient; and     -   f represents a damping coefficient.

In an embodiment, the gravity direction in the image plane may be computed using the device's accelerometers. The gravity direction may be the gravity vector components in the image plane, or the down direction depicted in the image if the gravity has no component in the image plane, i.e. gravity is perpendicular to the image plane. The user interacts with the tags by swinging them or tapping them. By using the device's accelerometers and an appropriate band pass filter within the device's software, a user shaking the device introduces acceleration elements and motion to the tags as if the tags where a pendulum that is being pushed/swung. Thus, the tags appear to swing side to side and/or in and out of the plane of the display. This same effect may be achieved through any other user action, such as using a sliding gesture to flip from one page to another on the display. Other embodiments of the invention provide tags that mimic such real world phenomena, for example, such as springs, e.g. tags that bounce when the device is shaken, spinners, e.g. tags that spin when the device is rotated, and animation including starbursts, explosions, avatars, and the like.

Photo Ads

A common activity for those who use the invention is browsing photo streams. Users can filter photos by style (modern, traditional), space (kitchen, bedroom), and/or specify search terms (red sofa). In this case user is looking at modern family room photos. The photo ad unit is designed to be injected seamlessly into this stream. An example is shown in FIGS. 1a and 1 b.

A photo ad has several components:

1. Promotional section contain information about the advertiser;

2. Photo; and

3. Products that are included in the photo.

Clicking on the photo leads to a page hosted by, for example houzz.com that includes more information about the products on the photo (FIG. 1b ). Users can click on “buy” button to purchase products on advertiser's Web site.

To make a photo ad appear seamless in the photo stream, it is important to select ads that are contextually relevant. This is achieved by the following:

1. Before a photo ad is run, the editorial team enter meta information about the photo, including:

-   -   Style (modern, traditional)     -   Space (kitchen, bedroom)     -   Title     -   Description     -   Information about each product

2. The system ad server uses contextual information to match meta information in photo ad, including:

-   -   Style (modern, traditional) of the current photo stream     -   Space (kitchen, bedroom) of the current photo stream     -   Search terms (red sofa)     -   Meta information of the photos in stream, including style,         space, title, description, image tags, and product tags. The         meta information includes both editorial and community         contributed content.     -   User's past activities, e.g. on Houzz.com, including style,         space, search terms used before, and photos viewed. In the         example shown, a matching photo ad is selected because user is         browsing modern family room photos.

Product Ads

In this ad unit, a Web site such as Houzz.com, for example, shows contextually relevant products associated with photos. An example is shown in FIG. 2.

Similar to photo ads, the ad server uses meta information in photos (both editorial and community contributed content) to find appropriate Product Ads).

Recommendations

In an embodiment, users can add photos/products to their idea books. When they are doing this, they are making implicit associations between items (photos/products) in their collections (idea books) (see FIG. 3):

1. Items in same collection share some common attributes.

2. Items added by same user share some common attributes.

An embodiment applies collaborative filtering algorithms to find similar items. For each item, there is a set of users who have added it to one or more idea books. Given any two items, the system derives a similarity score based on the intersection of the two user sets. The system can enhance the score with a similar calculation on the collections that are associated with the items. Calculating similarity scores on every pair of items produces an item similarity matrix that can be used in different scenarios, e.g.:

1. Recommending new items that the user has not seen before (see FIG. 4);

2. Suggesting items related to the item that is being shown (see FIG. 5a , which shows a recommendation and FIG. 5b , which shows a product linked to a tag in the photo on the screen shot of FIG. 5a ).

Because all of the implicit associations are analyzed, the system can produce interesting results, including:

1. Related items in different categories, for example, products related to photos;

2. Identify new groups of items that strongly related to each other.

Exemplary Embodiment in a Handheld Device

FIG. 6 is a screen shot of a photo category page, as shown on a handheld device, according to the invention. In this embodiment, various home design related categories are shown. Other embodiments of the invention may provide appropriate categories for other domains, for example automotive, gardening, etc.

FIG. 7 is a screen shot of a photo stream page, as shown on a handheld device, according to the invention. Once a category is elected, a photo stream is presented. Here the user is shown two streams, i.e. Metro Area and Style. Each of the streams may be further refined by user selection of appropriate pull down menus.

FIG. 8 is a screen shot of a photo stream page for an individual photo, as shown on a handheld device, according to the invention. This photo is selected from the photo stream of FIG. 7. Here, the user is given the option to add this photo to an idea book. The system monitors such selections, as discussed above, to refine system categorization and recommendations. The user is also provided with the option to seek information from a professional, such as the professional who prepared this photo in the photo stream as part of the professional's portfolio.

FIG. 9 is a screen shot of a profile page, as shown on a handheld device, according to the invention. If the user selected information about the professional in FIG. 8, then this information is presented to the user. The user is then presented with links to the professional's photo stream, projects, and other information.

FIGS. 10a and 10b are screen shots of a photo stream page for an individual photo that has animated tags, as shown on a handheld device, according to the invention. Here, a photo of a bedroom is shown. the photo may be added to the user's idea book. The user may also turn on tags within the photo. In this example, a fan is tagged. The tag is animated to draw attention to the tagged object. In a photo having many objects, especially on a handheld device having a small display, the animated tag draws attention to objects for which additional information is available. As shown in FIG. 10a the tag has tilted to the left in response to user movement of the device. the tag swings back and forth and tilts to the right, as shown in FIG. 10 b.

FIGS. 10c and 10d are screen shots of the photo ad page linked to the animated tags of FIGS. 10a and 10a , as shown on a handheld device, according to the invention. In FIG. 10c , the user has selected the tag for the fan of FIGS. 10a and 10b and a more detailed photo of the fan is presented. The user is also provided with links to the product vendor and an on line store if a purchase of the product is desired. In FIG. 10d , the user has selected the vendor option and more information is provided about the product.

Combined Presentation and Reading Mode Information Display

Presentation Mode vs. Reading Mode

When displaying information, such as articles, on an electronic device two distinct modes can be used.

A presentation mode typically consists of slides. Each slide contains some information and stands out on its own. One moves through the presentation by moving from slide to slide. The navigation is made simple by not needing to position each slide when reading; rather each slide occupies the entire display and is formatted to fit well in the display. A simple gesture, such as a button press or finger swipe, navigates the user to the next slide. In presentation mode there is no continuity, each slide is presented, typically occupying the entire display, and is whole in terms of content and self standing.

In a reading mode, that is typically employed by Web sites presented in a Web browser. Continuous information is displayed and the user can navigating by continuously scrolling down the display. Continuous scrolling provides continuity in the information presentation, the user never loses context of what was before what is on the screen and can see what is next. This display mode, however, lacks the visual impact of a slide mode. That is, the information can not be accurately positioned and laid out on the screen because it continuously scrolls. It also lacks the simplicity of a discrete event to move from slide to slide. Rather, the user is left with the burden of positioning the information on screen.

Accordingly, an embodiment of the invention provides a way to get the best of both worlds, a slide mode in a continuous scroll reading display. This is achieved in two steps:

1. A layout step identifies information units. These are groups of information items that relate to each other and which would be logical to display together. These information units are then laid out in such a way as to occupy screen full units as much as possible.

2. A presentation step which consists of a scrolling display and a smart snapping of the scroll position to logical location in the information unit display. In this way, the presentations appears as if the device is guessing what unit the user is looking at and positioning that information unit optimally on the screen.

Information Unit Identification

An information unit consists of items in the underlying text that can be grouped together, such as an image and its caption, paragraphs of text in the same text section or subsection, a title and its sub-title, author info, etc.

Information Unit Layout

When deciding on the layout of the unit, the screen size is brought into consideration. Some elements in the unit, such as photos, can typically be scaled and presented at several sizes. Other units, such as text, tend to have more of a fixed size, but their size can be controlled indirectly by setting the text column width, spacing between columns, spacing between paragraphs, etc.

When a unit has a photo and text, if the photo can be reduced in size such that the entire text that relates to it fits the screen, together with the image, the image is scaled down. The image is only scaled down up to a minimum presentation size. If the minimum size is not sufficient so that the text and image fit the screen together, the image is presented in the maximum scaling. This makes the unit fit an entire screen if possible, thus preserving an appealing visual impact by not down scaling images too much, and only scaling them down if by doing so the added benefit of having the text associated with it be visible in its entirety on-screen.

Presentation Snapping

When the user scrolls, a physics engine is used to move the screen and provide more realistic feedback. Using a physics engine in scrolling is common practice today and was pioneered by Apple in the iOS operating system on the iPhone. Usage of the physics system enables setting snap point, i.e. modifying the calculated target end points of the physics engine to more meaningful positions in the information flow, thus providing a natural snap feeling.

The target position of the scroll rest is set to the start of information unit, thus when the user flicks his finger on the screen, the scroll stops with an information unit at the top of the screen. Due to the layout the information unit fills the screen if possible. When the user continuously drags the scroll view, it is positioned where the user moved it to, providing a continuous scroll experience similar to regular scrolling presentations. No snapping is performed in this case.

Distinguishing between when to do snapping and when not to do snapping provides both the presentation feel, positioning the information feed at exact information units on the screen, and the continuous reading/scrolling feel enabling the user to position the information feed where he wants. When an information unit is too large to fit an entire screen, snapping positions inside the information unit are used, based on the structure of the information unit.

Examples are:

-   -   Top of a photo;     -   Top of a text description; and     -   Any break in a text description, such as sections, paragraphs,         etc.

These in-information unit breaks are not used when the information unit fits inside the screen. This gives a slide feeling to the information presentation.

FIGS. 11a-11f are screen shots that show pages as displayed with snapping (FIGS. 11a and 11b ) and smart layout (FIG. 11c ) according to the invention. It can be seen that snapping provides a coherent and graceful transition between displays unit, i.e. pages, while smart layout provides a more proportional and balanced presentation of images and text relative to each other.

In contrast, FIG. 11d is a screen shot that shows scrolling without snapping between the pages shown in FIGS. 11a and 11b ; FIG. 11e is a screen shot that shows scrolling without snapping between the page shown in FIGS. 11b and 11c ; and FIG. 11f is a screen shot that shows display of the page of FIG. 11 c without the smart layout. As can be seen, page transitions are awkward and nonsensical without the use of snapping, and here is no way to fit both the photo and its caption/referring text in the same screen without the use of smart layout.

Computer Implementation

FIG. 12 is a block schematic diagram of a machine in the exemplary form of a computer system 1600 within which a set of instructions for causing the machine to perform any one of the foregoing methodologies may be executed. In alternative embodiments, the machine may comprise or include a network router, a network switch, a network bridge, personal digital assistant (PDA), a cellular telephone, a Web appliance or any machine capable of executing or transmitting a sequence of instructions that specify actions to be taken.

The computer system 1600 includes a processor 1602, a main memory 1604 and a static memory 1606, which communicate with each other via a bus 1608. The computer system 1600 may further include a display unit 1610, for example, a liquid crystal display (LCD) or a cathode ray tube (CRT). The computer system 1600 also includes an alphanumeric input device 1612, for example, a keyboard; a cursor control device 1614, for example, a mouse; a disk drive unit 1616, a signal generation device 1618, for example, a speaker, and a network interface device 1628.

The disk drive unit 1616 includes a machine-readable medium 1624 on which is stored a set of executable instructions, i.e., software, 1626 embodying any one, or all, of the methodologies described herein below. The software 1626 is also shown to reside, completely or at least partially, within the main memory 1604 and/or within the processor 1602. The software 1626 may further be transmitted or received over a network 1630 by means of a network interface device 1628.

In contrast to the system 1600 discussed above, a different embodiment uses logic circuitry instead of computer-executed instructions to implement processing entities. Depending upon the particular requirements of the application in the areas of speed, expense, tooling costs, and the like, this logic may be implemented by constructing an application-specific integrated circuit (ASIC) having thousands of tiny integrated transistors. Such an ASIC may be implemented with complementary metal oxide semiconductor (CMOS), transistor-transistor logic (TTL), very large systems integration (VLSI), or another suitable construction. Other alternatives include a digital signal processing chip (DSP), discrete circuitry (such as resistors, capacitors, diodes, inductors, and transistors), field programmable gate array (FPGA), programmable logic array (PLA), programmable logic device (PLD), and the like.

It is to be understood that embodiments may be used as or to support software programs or software modules executed upon some form of processing core (such as the CPU of a computer) or otherwise implemented or realized upon or within a machine or computer readable medium. A machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine, e.g., a computer. For example, a machine readable medium includes read-only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals, for example, carrier waves, infrared signals, digital signals, etc.; or any other type of media suitable for storing or transmitting information.

Although the invention is described herein with reference to the preferred embodiment, one skilled in the art will readily appreciate that other applications may be substituted for those set forth herein without departing from the spirit and scope of the present invention. Accordingly, the invention should only be limited by the Claims included below. 

The invention claimed is:
 1. A system comprising: one or more processors of a machine; a display device having a screen size: a memory storing instructions that, when executed by the one or more processors, cause the machine to perform operations comprising: identifying the screen size of the display device; identifying a multi-page electronic document comprising a plurality of pages navigable using a continuous scroll, the plurality of pages arranged vertically such that the continuous scroll scrolls the plurality of pages in a vertical direction with respect to the display device, each of the pages comprising an image and text that describes the image; generating one or more scaled pages and one or more non-scaled pages from the plurality of pages by generating a scaled page or a non-scaled page from each of the plurality of pages based on whether the page is entirely displayable within the screen size by reducing an image size of the image above to a minimum scale size while keeping a text size of the text constant, wherein each scaled page is generated from a respective page of the plurality of pages by reducing an image size of the image of the page to the minimum scale size while keeping a text size of the text of the page constant so that both the image and the text are entirely displayable together within the screen size, and wherein each non-scaled page is generated from a respective page of the plurality of pages in response to determining that the image of the page and the text of the page cannot be entirely displayed together within the screen size if the image is reduced to the minimum scale size, the non-scaled page comprising the image presented at a maximum scaling and the text; generating one or more scrolls stops for each of the one or more scaled pages and each of the one or more non-scaled pages at locations based on whether the page is a scaled page or a non-scaled page, wherein each scaled page comprises an edge scroll stop on an edge side of the scaled page and each non-scaled page comprises one or more inner scroll stops at text borders or image borders within the non-scaled page; receiving, from a user through the display device, a first gesture to scroll the multi-page electronic document to a further page, the first gesture to scroll being a flick gesture; in response to the first gesture being the flick gesture to scroll and the further page being a scaled page, snapping to the edge scroll stop of the scaled page such that the scaled page is entirely displayed within the screen size; receiving, from the user through the display device, a second gesture to scroll the multi-page electronic document to another further page, the second gesture to scroll also being the flick gesture to scroll; in response to the second gesture being the flick gesture to scroll and the another further page being a non-scaled page, snapping to one or more inner page scroll stops within the non-scaled page such that the non-scaled page is not completely displayed within the screen size of the display device; receiving, from the user through the display device, a third gesture to scroll the multi-page electronic document to an additional further page, the third gesture to scroll being a drag gesture to scroll; and in response to the third gesture being a drag gesture to scroll, continuously scrolling to the additional further page without snapping.
 2. The system of claim 1, wherein the edge scroll stop for each scaled page is a top side of the scaled page.
 3. The system of claim 1, wherein the another further page comprises an inner page scroll stop at a top side of an image and another inner page scroll stop at a top side of text.
 4. The system of claim 1, wherein the display device is a touch screen.
 5. A method comprising: identifying a screen size of a display device; identifying a multi-page electronic document comprising a plurality of pages navigable using a continuous scroll, the plurality of pages arranged vertically such that the continuous scroll scrolls the plurality of pages in a vertical direction with respect to the display device, each of the pages comprising an image and text that describes the image; generating one or more scaled pages and one or more non-scaled pages from the plurality of pages by generating a scaled page or a non-scaled page from each of the plurality of pages based on whether the page is entirely displayable within the screen size by reducing an image size of the image to a minimum scale size while keeping a text size of the text constant, wherein each scaled page is generated from a respective page of the plurality of pages by reducing an image size of the image of the page to the minimum scale size while keeping a text size of the text of the page constant so that both the image and the text are entirely displayable together within the screen size, and wherein each non-scaled page is generated from a respective page of the plurality of pages in response to determining that the image of the page and the text of the page cannot be entirely displayed together within the screen size if the image is reduced to the minimum scale size, the non-scaled page comprising the image presented at a maximum scaling and the text; generating one or more scrolls stops for each of the one or more scaled pages and each of the one or more non-scaled pages at locations based on whether the page is a scaled page or a non-scaled page, wherein each scaled page comprises an edge scroll stop on an edge side of the scaled page and each non-scaled page comprises one or more inner scroll stops at text borders or image borders within the non-scaled page; receiving, from a user through the display device, a first gesture to scroll the multi-page electronic document to a further page, the first gesture to scroll being a flick gesture; in response to the first gesture being the flick gesture to scroll and the further page being a scaled page, snapping to the edge scroll stop of the scaled page such that the scaled page is entirely displayed within the screen size; receiving, from the user through the display device, a second gesture to scroll the multi-page electronic document to another further page, the second gesture to scroll also being the flick gesture to scroll; in response to the second gesture being the flick gesture to scroll and the another further page being a non-scaled page, snapping to one or more inner page scroll stops within the non-scaled page such that the non-scaled page is not completely displayed within the screen size of the display device; receiving, from the user through the display device, a third gesture to scroll the multi-page electronic document to an additional further page, the third gesture to scroll being a drag gesture to scroll; and in response to the third gesture being a drag gesture to scroll, continuously scrolling to the additional further page without snapping.
 6. The method of claim 5, wherein the edge scroll stop for each scaled page is a top side of the scaled page.
 7. The method of claim 5, wherein the another further page comprises an inner page scroll stop at a top side of an image and another inner page scroll stop at a top side of text.
 8. The method of claim 7, wherein in response to the second gesture being the flick gesture and the another further page being the non-scaled page, the inner page scroll stop at the top side of the image is snapped to such that a portion of the image or a portion of the text is not visible within the screen size.
 9. The method of claim 8, further comprising: receiving an additional flick gesture after snapping to the inner page scroll stop at the top of the image; and snapping to the another inner page scroll stop at the top side of the text.
 10. The method of claim 5, wherein the display device is a touch screen of a client device. 